Model Selection

Reinforcement Learning Inference Optimization

# Reinforcement Learning Inference Optimization

Acereason Nemotron 14B

AceReason-Nemotron-14B is a math and code reasoning model trained through reinforcement learning, based on DeepSeek-R1-Distilled-Qwen-14B, excelling in math and code reasoning tasks.

Large Language Model

A small-scale large language model enhanced by reinforcement learning, focused on improving the reasoning capabilities of a 1.5B parameter model

Large Language Model

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase